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Convergence and Tradeoff of Utility-Optimal CSMA 
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Abstract 

It has been recently suggested that in wireless networks, CSMA-based distributed MAC algorithms could achieve optimal 

■ utility without any message passing. We present the first proof of convergence of such adaptive CSMA algorithms towards an 
' arbitrarily tight approximation of utility-optimizing schedule. We also briefly discuss the tradeoff between optimality at equilibrium 
' and short-term fairness practically achieved by such algorithms. 

D 

■ I. Model and Algorithm 

A. Network model and optimization problem 

Consider a wireless network composed by a set X of L interfering links. Interference is modeled by a symmetric, boolean 
matrix A E {0, 1)^^^, where A/i = 1 link / interferes link k. Define hy N c {0, 1)^ the set of feasible link activation profiles, or, 
schedules. A schedule m 6 TV is a subset of non-interfering active links. The transmitter of link / can transmit at a fixed unit 

■ rate when active, and all links are saturated with infinite backlog. 

> : 

, Denote by 7 = (y;, I e £.) the long-term throughputs achieved by a scheduling algorithm, which determines which links 

0^ ' are activated at each time. Let U ; — > R be an increasing and strictly concave objective function. The following utiUty 
^ ■ 

! optimization problem over schedules has been extensively studied; 

o : 

max l,ie£U(yi), s.f^l e £,yi <I.,„;iemJ^m (1) 



X 



where n,,, is the long-term proportion of time when schedule m is used. We denote by 7 = (y,, / e X) the solution of ([T]). Most 
of the proposed distributed schemes to solve (1) make use of a dual decomposition of the problem into a rate control and a 
scheduUng problem: A virtual queue is associated to each link; a rate control algorithm defines the rate at which packets are sent 
to the virtual queues, and a scheduling algorithm decides, depending on the level of the virtual queues, which schedule to use 
with the aim of stabilizing all virtual queues. The main challenge reduces to developing a distributed and efficient scheduling 
algorithm. Most of the proposed solutions are semi-distributed implementations of the max-weight scheduler introduced in [1], 
and require information about the queues to be passed around among the nodes or links (e.g., see a large set of references in 
[2]). This signaling overhead increases communication complexity and reduces effective throughput of the algorithms. 

Recently, there have been proposals that do not use any message passing, and yet achieve high efficiency [3]-[5]. These 
algorithms are based on the random access protocol of CSMA (Carier Sense Multiple Access), and leverage simulated annealing 
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techniques to solve the max-weight scheduhng problem as first proposed in [6]. In this paper we provide the first rigorous 
proof that such adaptive CSMA algorithms indeed converge, using stochastic approximation tools with two time-scales. Then, 
we quantify the impact of inevitable collisions in CSMA protocols on the trade-off between their long-term efficiency and 
short-term fairness, the latter defined here as /3 - 1 / ma\{Ei, I e X}, where £/ is the average duration during which link / do not 
transmit successfully. It turns out that a price to pay for the surprisingly good utility performance by these message-passing-free 
algorithms is short-term starvation. 

B. Adaptive CSMA algorithms 

To access the channel, each transmitter I e X. runs a random back-off algorithm parametrized by two positive real numbers 
{Ai,fii), denoted as CSMA(/l/,///); after a successful transmission, the transmitter randomly picks a back-off counter according 
to some distribution of mean Ar, it decrements the counter only when the channel is sensed idle; and it starts transmitting when 
the back-off reaches 0, and remains active for a duration yU;. 

We first consider a continuous-time and then a discrete-time version of CSMA algorithms. In the ideal continuous-time 
setting (considered in this and the next sections), the back-off counter distribution is exponential, so that two interfering links 
cannot be activated simultaneously and collisions are avoided. In practice, back-offs are discrete, say geometrically distributed. 
In this discrete setting (studied in Section III), collisions may occur, degrade performance, and introduce tradeoff between 
long-term utility and short-term fairness. 

If parameters {Ai,^i),l e £. were fixed, the analysis of the dynamics of systems under continuous-time CSMA algorithms 
would be classical (e.g., [7] and references therein). In this case, in steady state the set of active links evolves according to a 
reversible Markov process whose stationary distribution, denoted by tt^'*^, is defined by: 

Vm e N,n •'^(m) = — — . (2) 

Ijm'eN 1 1/em' 

We now describe how transmitters adapt their CSMA parameters. Time is slotted and transmitters update their parameters 
at the beginning of each slot. To do so, they maintain a virtual queue, denoted by qi{t] in slot f, for link /. The algorithm 
operates as follows (the algorithm presented here is an extension of those proposed in [3]): 
Algorithm 1. 

1) During slot f, the transmitter of link / runs CSMA(/l;[f],ju), and records the amount S i{t] of service received during this 
slot; 

2) At the end of slot f, it updates its virtual queue and its CSMA parameters according to: 



W'{qi[t]) V 



qi[t+ 1] = 

and set Ai[t + I] - exp{W{qi[t + 1])). 
At the beginning of each slot, each non-active transmitter picks a new random back-off counter to account for the CSMA 
parameter update. In Algorithm 1, : N — » R is a step size function; W : — » is a strictly increasing and continuously 
differentiable function, termed weight function; V, ^™", ^™™(> ^™"') are positive parameters, and [-j^ = max(^f , min(c, •))• 



II. Convergence Proof 

The main difficulty in analyzing the convergence of Algorithm 1 lies in the fact that the updates in the virtual queues depend 
on random processes {Si[t],t > 0), whose transition rates in turn depend on the virtual queues. As we will demonstrate, it 
is possible to represent Algorithm 1 as a stochastic approximation algorithm with controlled Markov noise as introduced by 
Borkar [8]. 

We use the notation tt'' to denote the distribution on N defined by: 

w ^ q, . exp(2te„, W(q,)) 

Vm E N, n^(m) = . (3) 

exp(i;/a„' W(q,)) 

We also denote by 7[f] - (jilt] - | YJiZo S I e -0 the random variable representing the time-averaged service rates received 
by the various links over the interval [0,0- The following theorem states the convergence of Algorithm 1, for diminishing 
step-sizes (similar but weaker results are readily obtained for constant, small step-sizes). 

Theorem 1: Assume that V < W(^™^'')/?7'(1) and that XZo^i'^ - XZo^i^^^ < For ^"Y initial condition q[0], under 
Algorithm 1, we have the following convergence: 

lim q[t] - q* and lim7[f] = 7*, almost surely, 

where q* and 7* are such that: {■j*,n'^') solves: 

max yZ,e£ U (yi) - I.„,eN^„, log n„, 

s.t. r/ < 2,„eyv:m,=l^m> ^ineN^m-'^- (4) 

Furthermore, Algorithm 1 approximately solves ([T]i as: 

|2/E£(C/(r/) - U(yf))\ < log \N\/V. (5) 
Proof. We first show that the network dynamics under the continuous-time random CSMA protocol can indeed be averaged and 
it asymptotically approaches to a deterministic trajectory (see Lemma [T]). Resolving this bottleneck in understanding adaptive 
CSMA is the main contribution in this proof. Then we prove that the resulting averaged algorithm converges to the solution 
of®. 

Step 1. From the discrete-time sequence {q[t],t > 0), we define a continuous function q(-) as follows. Define for all n, 
tn = 2"=! t>[i], and for all for all t„ <t < t„+\, 

t - 1„ 

qiiO = qi[n] + (qi[n + 1] - qi[n]) X (- -). (6) 

Lemma 1 (Convergence and averaging): Denote by q the solution of the following system of o.d.e.'s: for all 

dqildt^[u'-\w,{q,)lv)-^„r^N-.nn=l7:Hm)} , (7) 

with q(T) = q{T). Then we have: for all T > 0, 

hm sup \\q{t) - q(t)\\ = a.s. (8) 

te[T,T+T] 

Lemma [T| shows that the trajectory of the continuous interpolation q of the sequence of the virtual queues q asymptotically 
approaches that of q. Note that in the limiting o.d.e.'s, the service Si received on each link is averaged (as if the virtual queues 
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were frozen), and this averaging property constitutes the key challenge in analyzing the convergence of Algorithm 1 . 

To prove Lemma[Tl we attach to each link / a variable «;[?], where «;[?] = 1 if the link is active at time t (at the end of slot 
f), and otherwise. Now it can be easily seen that Y[t] - {S[t],a[t]) is a non-homogeneous Markov chain whose transition 
kernel between times t and f + 1 depends on q[t] only (this can be checked as in [7]). Now the updates of the virtual queues 
in Algorithm 1 can be written as: 

qi[t+l]^q,[t]+b[t]xh(q,[t],Yi[t]), 

where 

h{q, Y) = --L-(U'-\W{q)m -Si)) ■ 1„„,.„<„<,„,»|. 

As a consequence, Algorithm 1 can be seen as a stochastic approximation algorithm with controlled Markov noise as defined 
in [8], [9]. To complete the proof of Lemma 1, we check the conditions as stated in [9]. 

1) The transition kernel of ^[f], parametrized by q[t], is continuous in q[t] (because the transition rates from one state to 
another are determined by the /l/[f]'s, which are continuous in the ^/[f]'s). Note also that fixing q[t] = qo, the obtained Markov 
chain Y[t] is ergodic (its state-space is finite) with stationary distribution tt''". 

2) h is continuous and Lipschitz in the first argument, uniformly in the second argument. This can be easily checked, given the 
properties of the utility and weight functions U and W and observing that we restrict our attention to the compact [(7™'",^"^"]. 

3) Stability condition: qi[t] < q'™" for all / e X and t > 0. 

4) Tightness condition (corresponding to (f) in [9][pp.71]): This is satisfied since Y[t] has a finite state-space (cf. conditions 
(6.4.1) and (6.4.2) in [9] [pp.76]). 

This completes the proof of Lemma [T] By Lemma 1, if there exists an equilibrium q* such that lim,^oo q{t) = q*, then we 
would also have: lim,^oo q[t] - q* a.s. (see [10] for details). 

Step 2. To complete the proof, we show that (|7]i may be interpreted as a sub-gradient algorithm (projected on a bounded 
interval) solving the Lagrange dual problem of (|4|. Let D{v, rf) denote the dual function of (|4]i. Then we show that (|7]i is the 
sub-gradient algorithm of: 

min D{v, rf), s.t. v™" < v, < VZ e L- (9) 

Here we include the upper-bound v™™ (resp. lower-bound v™'") that corresponds to the limitation of the ^/'s: v'™" = W{q™™) 
(resp. V™" = W(^™'")). The Lagrangian of (|4|i is given by: 

L(7, TT; l>, if) =(£/exy log y/ - V/y,) + (Stei:T'Ane//:/«, = l-^m 

log7r„0 + ri(I.,„eNJT„ - 1). 

Then the KKT conditions of (|4|i are given by: 

VU'iji) = V,, V/ € X, (10) 
-1 - log;r,„ + Z,:„„=iv/ + 77 = 0, VmeN, (11) 

V/(y; - 'L,„eN:mi = l^m) = 0. (12) 
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Now if V; - W{qi) for all /, (fTTT i is solved for itt'' (in view of ([3])). The sub-gradient of ( fT2b (when accounting for (fTOl i) is: 

dy,/flff = (c/'-'(v;/y) - 4) ■ l(v-»<v,<v-b (13) 

which is equivalent to (|7|l, provided that the solution v*, / e X, of (|9]l without the constraints v™'" < v < v™" actually belongs to 
the interval [ymm ^maxj latter condition is satisfied by simply combining y/ < 1, ( fTOl ). and the assumption y < v™''/f/'(l). 
Finally, since (|9]l is a strictly convex optimization problem, ( fT3] l converges to its unique equilibrium i/*. Finally, the inequality 
(O is obtained comparing ([Hi and Q, because entropy - 2/n log^r,,, is always bounded by log \N\. The proof of Theorem[T] 
is completed. □ 

III. Short-term fairness vs. long-term efficiency 

In the previous section, we have analyzed the convergence of Algorithm 1 in the ideal, continuous-time setting of CSMA 
protocols with no collisions. In practical implementaions however, the back-off counters are discrete and collisions possible 
[5], [11]. To keep collision rates less than e, we scale down in Algorithm 1 the transmission probabilities to e/l/[r] and scale 
up the channel holding time to fj./e. Denote by 7* the throughput vector obtained with this modified algorithm. Of course, we 
have lim^^oT* = 7*- More precisely, using standard perturbation analysis, one can show that for all I: 

y*.i = y* - Ci£ + o(e). 

The constants C; can be derived explicitly for networks with simple interference structures, but are more diflicult to obtain for 
general networks. For example, in networks with full interference, i.e., where all links interfere each other, we can easily prove 
that Ci is roughly equal to A*, which in turn is equal to /i™^" - fi^^ exp W(^™'"') (indeed, in view of (|2|, the throughput on 
any link is just equal to A* jijil + LA*fj.), an increasing function of A*). Combining the above observation and the distance 
between 7* and the utility optimal vector 7, which represents the efficiency gap of the algorithm, scales as ki/V + ekoiX*) 
when e is small, where ki is a positive constant, and k2 a constant that depends on A*). From the assumption on V in Theorem 
1, we deduce that the efficiency gap scales as ^1 / log(/l™^''//) + ek2{\*). For network with full interference, the gap scales as: 
;ti/log(.l™''ju)H-ei™\ 

Let us now evaluate the short-term fairness index. Using cycle formula [12], at the equilibrium, the average of periods, 
during which link / do not transmit successfully, is given by: 

Et^-x = - X ^ +o(e) . 

Then the short-term fairness index /3 scales as kj, ■ e//Y, where kj is a positive constant. 

Now if we want to guarantee an efficiency gap less than 6, in view of the above analysis, we must have /I""// > exp{ki/6). 
In the case networks with full interference, this further implies that: /3 < k^6/ exp{ki/6). For networks with more general 
interference structure, analytically expressing the tradeoff between efficiency and short-term fairness is more difficult, but the 
conclusion remains similar: to approach optimality in sum utility in the long-run, some node will be denied channel access 
for longer time, and we must pay a price of short-term unfairness that grows very fast (the channel holding time must grow 
hke 
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Fig. 1. Efficiency vs. short-term fairness tradeofl" in a 3-Iink linear network. Algorithm parameters: b[t] = 0.001, W(x) = x, V = I, e/i'"'"^ = 0.1. 

We illustrate this tradeoff numerically on a simple 3-link linear network, in which link 1 and link 3 both interfere with link 
2 but link 1 and link 3 do not interfere. Figure [1] shows the efficiency (i.e., 1 -5) as a function of l/(short-term fairness index). 
10 experiments were carried out with different random seeds for each value in x-axis. In the practical setting with collisions, 
85% efficiency, in terms of utility achieved, is quite good for random access without message passing, although this efficiency 
drops as short-term fairness improves exponentially. 
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